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TWO STERILE-20 KINASE-LIKE PROTELNS AND METHODS OF USE THEREOF 

RELATED APPLICATION 

This application claims benefit of U.S. Provisional application 60/069,078, 
filed December 9 ? 1997 the entire teachings of which are incorporated herein by 
5 reference. 

BACKGROUND 

Protein kinases play a key role in cell growth and differentiation. The p2 1- 
activated proteins kinases (PAKs) are related to a yeast serine/threonine protein 
kinase, Ste20. Ste20 is a member of a growing family of regulatory enzymes that 
.10 may play roles in diverse phenomena such as cellular morphogenesis, the stress 
response and the pathogenesis of AIDS. 

The growth, differentiation, maintenance and senescence of cells requires the 
transmission of signals. These signals can be derived from extracellular stimuli, 
such as hormone interaction with its receptor or physiological stress, or may be 
15 derived intracellularly from developmental programs. 

The signals are transmitted via a signal cascade wherein proteins are 
phosphorylated or dephosphorylated in sequence. Protein kinases, the enzymes that 
phosphorylate, play a key role in many signaling pathways and therefore protein 
kinases play a key role in cell growth and differentiation. Protein kinases fall into 
20 two broad categories, tyrosine kinases (those that add phosphate to tyrosine 

residues) and serine/threonine kinases (those that add phosphate to either serine or 
threonine residues). 

One group of signaling pathways involving protein kinases are the mitogen- 
activated protein (MAP) kinase cascades. The MAP kinase cascades are now 
25 recognized to participate in diverse signal transduction pathways rather than only 
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Genetic analyses in yeast have made it possible to identify some of the 
components of MAPK pathways, and to clarify the diversity of their functions and 
regulation (Herskowitz, L. 1995, Cell 80:187-197). Mammalian cells have at least 
three MAPK pathway subtypes, the ERK (extracellular signal-regulated protein 
5 kinase) pathway, JNK (c-Jun NH r terminal kinase)/SAPK (stress-activated protein 
kinase) pathway, and P38/Mpk2 pathway; together, these pathways mediate a wide 
variety of physiological responses (Davis, R.J., 1994, Trends in Biochem. Science 
19:470-473; Derijard v B. eta/., 1994 Cell 76: 1025-1037; Kyriakis, J.M. et aL, 1994 
Nature 369:156-160: Han, J. et aL. 1994 Genes Dev. 3:1336-1348; Lee, J.C. et aL. 

10 1994 Nature 372:739-746; Rouse, J. et aL, 1994 Cell 78:1027-1037). MAPKs are 
activated by sequential protein phosphorylation reactions. The basic framework of 
the MAP kinase pathway, where MAPK is phosphorylated on Thr and Tyr residues 
and activated by MAPK kinase (MAPKK), after which MAPKK is itself 
phosphorylated and activated by MAPKK kinase (MAPKKK), is common from 

1 5 yeast to mammals (Nishida, E. and Gotoh, Y., 1993 Trends in Biochem. Science 
18:128-131; Davis, 1994; Herskowitz. 1995; Marshall, C.J., 1994 Curr. Opux. 
Genet. Dev. 4:82-89). 

A new kinase group activated by G-protein and thought to act as MAPKKK 
kinase (MAPKKKK) has been identified in both yeast and mammals. In budding 

20 yeast, this new kinase group is known by its prototype member Sterile 20 (Ste20). 
Ste20 is activated by the py complex released from the heterotrimeric G protein 
complex upon pheromone receptor stimulation, and in turn activates Stel 1 (a 
MAPKKK) (Leberer. E. et aL. 1992 EMBO J. 1 1:4815-4824; Ramer. S.W. and 
Davis, R.W., 1993 Proa NatL Acad Sci. U.S.A. 90:452-456). In mammals, this new 

2> kinase group is known by its prototype member p21 -activated protein kinase (PAK). 
PAK (now called a-PAK) has been identified as a protein kinase activated by the 
Rho family of small G-proteins. Racl and Cdc42 (Manser, E. et al.. 1994 Nature 
367:40-46); PAK also shows sequence similarity to yeast Ste20. Recently it has 
been clarified that PAK comprises a protein kinase family composed of several PAK 
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iso forms, hPAK65 (Martin. G.A. et aL, 1995 EMBOJ. 14:1970-1978). MPAK-3 
(Bagrodia, S. et al. 1995 J. Biol Chem. 270:2273 1-22737 and 8-PAK (Manser. E. 
et uL. 1995 J. Biol Chem. 270:25070-25078), all of which are able to interact with 
Cdc42 and Racl. Racl and Cdc42 have been implicated not only in cell motility 
5 ( Ridley, A.J. et aL 1992 Cell 70:401-410; Kozma, R. et aU 1995 Mol Cell Biol 
15:1842-1952; Nobes, CD. and Hall, A., 1995 Cell 81:53-62), but also in the 
preferential activation of the JNK/SAPK and p38/Mpk2 pathways rather than the 
ERK pathway (Coso, O.A. et al, 1995 Cell 81 : 1 137-1 146; Minden, A. etal., 1995 
Ce//81:ll47-1157; Olson, M.F. etal.. 1995 Science 269:1270-1272). This is in 

1 0 contrast to another small G-protein, Ras, which predominantly activates the ERX 
pathway through Raf activation (Minden, A. et al, 1994 Science 266:1719-1 723). 
Although a direct interaction between PAKs and components of the JNK/SAPK and 
p3S/Mpk2 pathways has not yet been demonstrated, these observations raise the 
intriguing possibility that PAK or PAK-related proteins mediate the signals from 

15 Racl and Cdc42 to the JNK/SAPK and p38/Mpk2 pathways, and, furthermore, that 
G-proteins differentially regulate MAPK pathways to achieve various physiological 
responses. 

In contrast to the above-mentioned kinases, a group of Ste20-related kinases 
that lack the putative Cdc42/Racl-binding domain has been identified in both yeast 

:0 and mammals: Spsl, an upstream regulator of the MAPK pathways (Creasy, CL. 
and Chernoff, J., 1995 7. Biol Chem. 270:21695-21700). Although the specific 
activation of the SAPK pathway by BCK has recently been reported (Pombo, CM. 
et uL. 1995 Nature 377:750-754), the upstream and downstream signaling pathways 
of this group of kinases remain to be clarified. Sterile 20-related kinases are 

5 regulatory molecules involved in mitogenic signaling as well as other cellular 
phenomena such as morphology and motility. These phenomena are important 
factors in development, cell differentiation, cancer and metastases. Therefore, the 
polynucleotides and polypeptides of the present invention allow manipulation of the 
signaling pathways involved and will allow the development of reagents to modulate 

0 the signaling pathways involved in these important cellular phenomena. 
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SUMMARY OF THE INVENTION 

The present invention relates to isolated polynucleotides encoding novel 
members of the Ste20 family of serine/threonine protein kinases, and the 
recombinantly produced polypeptides encoded by said polynucleotides. 
5 The present invention is drawn to a purified nucleic acid (also referred to 

herein as a polynucleotide) comprising at least 10 continuous nucleotides of a 
nucleic acid sequence provided in SEQ ID NO.: 1 . The present invention is drawn to 
a purified nucleic acid comprising at least 10 continuous nucleotides of a nucleic 
acid sequence provided in SEQ ID NO.: 3. The present invention is also drawn to a 
1 0 purified nucleic acid sequence comprising a nucleic acid sequence encoding at least 
S continuous amino acids of a amino acid sequence provided in SEQ ID NO.: 2. 

Also encompassed by the present invention are the following: a purified 
nucleic acid sequence comprising a nucleic acid sequence encoding at least 8 
continuous amino acids of a amino acid sequence provided in SEQ ID NO.: 4; a 
5 purified nucleic acid comprising a nucleotide sequence which comprises at least 16, 
IS. 19 or 20 nucleotides, hybridizing under stringent conditions, to at least 20 
continuous nucleotides provided in SEQ ID NO. 1: a purified nucleic acid 
comprising a nucleotide sequence which comprises at least 16. IS, 19 or 20 
nucleotides, hybridizing under stringent conditions, to at least 20 continuous 
0 nucleotides provided in SEQ ID NO. 3; a purified nucleic acid comprising SEQ ID 
NO. 1 or SEQ ID NO. 3. 

The present invention also encompasses expression vectors comprising the 
polynucleotides of the present invention and host cells harboring said vectors. The 
present invention is also drawn to a purified nucleic acid comprising a nucleic acid 
5 sequence encoding an amino acid sequence comprising SEQ ID NO. 2 or a nucleic 
acid sequence encoding an amino acid sequence comprising SEQ ID NO. 4, and to a 
method of making polypeptides encoded by SEQ ID NO.l or SEQ ID NO. 3 
comprising, transfecting a host cell with an expression vector comprising SEQ ID 
NO. 1 or SEQ ID NO. 3 and isolating the expressed protein. 
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The present invention is further drawn to polypeptides comprising SEQ ID 
NO. 2 or SEQ ID NO. 4 and biologically active fragments thereof; polypeptides 
functionally equivalent to polypeptides comprising SEQ ID NO. 2 or SEQ ID NO. 4 
or fragments thereof; antibodies that bind to polypeptides encoded by SEQ ID NO.l 
5 and antibodies that bind to polypeptides encoded by SEQ ID NO. 3. 

The present invention is further drawn to a serine/threonine kinase 
comprising the amino acid sequence of SEQ ID NO. 2, and to a serine/threonine 
kinase comprising the amino acid sequence of SEQ ID NO. 4. 

The protein kinase described herein has homology with a family of proteins 
10 that appear to function in the mitogen-activated protein (MAP) kinase cascade. The 
present invention also relates to methods of using the polynucleotides and 
polypeptides described herein to detect, isolate and characterize elements upstream 
and downstream of the novel kinase in the signal transduction pathway using assays 
well known in the art such as kinase assays or co-immunoprecipitation assays, or 
1 5 combinations thereof. Furthermore, polypeptides of the present invention include 
biologically active fragments of the proteins described herein. Such proteins and 
biologically active fragments are useful to generate antibodies that specifically bind 
the proteins of the present invention. The biologically active fragments are also 
useful as tools to study the activity of the protein. Altered forms of the polypeptides 
20 are within the scope of the present invention and can be used to study the activity of 
downstream elements in the signaling pathway or to generate specific antibodies. 

The isolated polynucleotides and polypeptides of the present invention 
provide the advantage of being able to conveniently manipulate the genes, gene 
products and expression level of the gene product, to facilitate understanding of how 
2> Ste2() and Ste20 family members regulate signal transduction in the cell. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts the cDNA sequence encoding C12.2bs (SEQ ID NO. 1). 
Figures 2A and 2B depicts the cDNA sequence encoding Se.new (SEQ ID 

NO. 3). 

Figure 3 depicts the PGR primers (SEQ ID NOs. 5-7) used to clone the novel 
genes of the present invention. 

Figure 4 depicts the amino acid sequence of C12-2bs (SEQ ID NO. 2). 5e. 
new (SEQ ID NO. 4) and S201 (SEQ ID NO. 8), a human Ste20 homologue. 

Figure 5 depicts an amino acid sequence alignment between C12.2bs (SEQ 
ID NO. 2) and 5e.ne\v (SEQ ID NO. 4). 



DETAILED DESCRIPTION OF THE INVENTION 

Ste20/PAK serine/threonine protein kinases (referred to herein as Sterile 20 
protein kinases or Ste20) have been suggested as playing essential roles in Averse 
phenomena such as cellular morphogenesis, the stress response and the pathogenesis 
1 5 of AIDS. Recently, mammalian Ste20 family members have been discovered that 
do not appear to participate in the three known MAP kinase cascades. While much 
is known about the Ste20 family of protein kmases and their interaction with the 
MAP kinase cascade, it is clear that the full extent of the Ste20 family as well as the 
upstream and downstream regulatory components are poorly understood. 
0 Described herein are new members of the Ste20 family that were cloned 

from a murine cDNA library using polymerase chain reaction (PCR). Degenerate 
primers SEQ ID NO. 5. SEQ ID NO. 6 and SEQ ID NO. 7 (Figure 3) were used in 
the following combinations: SEQ ID NO. 5 with SEQ ID NO. 6. and SEQ NO. 5 
with SEQ ID NO. 7, in a standard PCR reaction as described in the Exemplification. 
5 Novel members of the Ste20 family were isolated and are shown in Figures 1 and 2 
(SEQ ID NO. 1 and SEQ ID NO. 3). 

The present invention encompasses the isolated and/or recombinant nucleic 
acid sequences encoding novel members of the Sterile 20 family of serine/threonine 
protein kinases, functional equivalents thereof or biologically active fragments 
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thereof, as described herein. The present invention further encompasses sequences 
complementary to, or homologous with SEQ ID NO. 1 and SEQ ID NO. 3. 

The polynucleotides of the present invention, or portions thereof, can be used 
as probes to isolate and/or clone substantially similar or functionally equivalent 
5 homologues of the Ste20 family of proteins. The polynucleotides of the present 
invention can also be used as probes to detect and or measure expression of the 
genes encoded by the present invention. Expression assays, such as Southern blot 
anaysis and whole mount in situ hybridization are well known in the art. The 
polynucleotides of the present invention, or portions thereof, can be used as primers 
10 to clone homologues or family members by PCR using techniques well known in the 
art. 

As used herein, nucleic acids are also referred to as DNA and RNA, or DNA 
sequences and RNA sequences, or DNA molecules or RNA molecules. Nucleic • 
acids referred to herein as "isolated" are nucleic acids separated away from the 
15 nucleic acids of the genomic DNA or cellular RNA of their source of origin (e.g.. as 
it exists in cells or in a mixture of nucleic acids such as a library), and may have 
undergone further processing. "Isolated" nucleic acids include nucleic acids 
obtained by methods known to those of skill in the ait to obtain isolated nucleic 
acids and methods described herein. These isolated nucleic acids include essentially 
20 pure nucleic acids, nucleic acids produced by chemical synthesis, by combinations 
of biological and chemical methods, and recombinant nucleic acids produced by 
recombinant methods, which are well known in the art. 

Nucleic acids referred to herein as "recombinant" are nucleic acids which 
have been produced by recombinant DNA methodology, including those nucleic 
25 acids that are generated by procedures which rely upon a method of artificial 

recombination, such as the polymerase chain reaction (PCR) and/or cloning into a 
vector using restriction enzymes. "Recombinant" nucleic acids are also those that 
result from recombination events that occur through the natural mechanisms of cells, 
but are selected for after the introduction to the cells of nucleic acids designed to 
allow and make probable a desired recombination event. 
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Also encompassed by the present invention are nucleic acid sequences (DNA 
or RNA sequences) which are complementary, substantially homologous to, or 
functionally equivalent to the Sterile 20 protein kinase DNA sequences described 
herein. Fragments of the sequences, complementary sequences, substantially 
5 homologous sequences and functionally equivalent sequences are also encompassed 
by the present invention. Nucleic acid sequences hybridizing with sequences 
comprising SEQ ID Nos 1 and 3 and portions thereof under conditions of stringency 
known to those of skill in the art to be sufficient to identify DNA sequences with 
substantial nucleic acid sequence identity are also encompassed by the present 
1 0 invention. Due to the degeneracy of the genetic code, different combinations of 

nucleotides can encode for the same polypeptide or the homologous polypeptide in a 
different organism. Thus, different nucleic acids can encode the same Sterile 20 
protein or Sterile 20 homologue. These polynucleotides are referred to herein as 
functionally equivalent polynucleotides. In preferred embodiments, substantially 
L 5 homologous polynucleotides of the present invention comprise at least 1 0. at least 
27, at least 45 or at least 102 continuous nucleotides of SEQ ID NO.: 1 or SEQ ID 
NO.: 3. It is reasonable to predict that DNA sequences identified under such 
stringent conditions will likely encode a protein (also referenced to herein as a 
polypeptide, or peptide fragment) with the biological activity or physical 

20 characteristics of Sterile 20 protein kinases comprising SEQ ID NOs 2 or 4. The 
nucleic acid of the present invention preferably encodes 8. 18, 34 or 54 continuous 
amino acids of the amino acid sequence provided in SEQ ID NO. 2 or the amino 
acid sequence provided in SEQ ID NO. 4. 

Polypeptides with the biological activity or physical characteristics of Sterile 

25 20 protein kinases comprising SEQ ID NOs 2 or 4 are referred to herein as 

functionally equivalent polypeptides. "Functional or biologically active protein" is 
defined herein as a protein which shares significant identity (e.g., at least about 65%, 
preferably at least about 80% and most preferably at least about 95%) with the 
corresponding sequences of the endogenous protein and possesses one or more of 

30 the functions thereof. 
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"Polypeptide" as used herein indicates a molecular chain of amino acids and 
does not refer to a specific length of the product. Thus, peptides, polypeptides and 
proteins are included within the definition of polypeptide. This term is also intended 
to refer to post-expression modifications of the polypeptide, for example, 
5 glycosylations, acetylations, phosphorylations and the like. 

A general description of stringent hybridization conditions are discussed in 
Ausubel, F.M., et aL, Current Protocols in Molecular Biology, Greene Publishing 
Assoc. and Wiley-Interscience 1989. the teachings of which are incorporated herein 
by reference. Factors such as probe length, base composition, percent mismatch 

10 between the hybridizing sequences, temperature and ionic strength influence the 
stability of nucleic acid hybrids. Thus, stringency conditions sufficient to identify 
the polynucleotides of the present invention, (e.g., high or moderate stringency 
conditions) can be determined empirically, depending in part upon the 
characteristics of the known DNA to which other unknown nucleic acids are being 

1 5 compared for sequence similarity. 

Alternatively, conditions for stringency are as described in WO 9S/40404, the 
teachings of which are incorporated herein by reference. In particular, examples of 
highly stringent, stringent, reduced and least stringent conditions are provided in 
WO 98/40404 in the Table on page 36. In one embodiment, highly stringent 

20 conditions are those that are at least as stringent as, for example, lx SSC at 65°C, or 
Ix SSG and 50% formamide at 4TC. Moderate stringency conditions are those that 
are at least as stringent as 4x SSC at 65*C, or 4x SSC and 50% formamide at 42"C. 
Reduced stringency conditions are those that are at least as stringent as 4x SSC at 
50 C. or 6x SSC and 50% formamide at 40°C. 

- 5 As defined herein, substantially complementary means that the sequence need 

not reflect the exact sequence of e.g.. SEQ ID NOs. 1 or 3, but must be sufficiently- 
similar in identity of sequence to hybridize with SEQ ID NOs. 1 or 3 under stringent 
conditions. For example, non-complementary bases, or longer or shorter sequences 
can be interspersed in sequences, provided the sequence has sufficient 

30 complementary bases with, e.g.. SEQ ID NOs. 1 or 3 to hybridize therewith. Under 
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stringent hybridization conditions, only highly complementary nucleic acid 
sequences hybridize. Preferably, such conditions prevent hybridization of nucleic 
acids having four mismatches out of 20 continuous nucleotides, more preferably two 
mismatches out of 20 continuous nucleotides, most preferably one mismatch out .of 
5 20 continuous nucleotides. 

Biological functions of the Sterile 20 kinase proteins include phosphorylation 
of serine and threonine residues in response to extracelluarly derived or 
intracellular^ derived stimuli. Phosphorylation activity can be measured using 
methods well know in the art. For example, the kinase of the present invention can 

0 be epitope- tagged using methods well known in the art. The tagged protein are then 
expressed in suitable cells, such as COS7 cells, using methods well known in the art. 
The cells are then stimulated or not, to activated the Sterile 20 signal transduction 
pathway and lysed under appropriate conditions. The kinase is isolated by 
immunoprecipitation. The kinase activity is measured in. a standard kinase assay 

5 with a suitable substrate, such as myelin basic protein (see, for example. Creasy and 
Chernoff. J, Biol Chem. 270:21695-21700 (1995)). The present invention also 
pertains to an isolated nucleic acid sequence encoding an altered kinase protein, 
wherein the kinase has altered activity. The activity can be enhanced or reduced or 
abolished. The alteration can result in a constitutively active kinase. In one 

0 embodiment, the resulting alteration in amino acid sequence is in the catalytic 
domain of the kinase. 

Biological activity of the present invention further includes the ability to bind 
the normal upstream (for example an activator) or downstream (for example the 
target to be phosphorylated) element in the signaling pathway. Also described 

5 herein, biological activity can include the antigenicity of the protein, or peptide, 
resulting in the production of antibodies which bind to the polypeptides. Sterile 20 
kinases and serine/threonine kinases of the present invention. 

The present invention also relates to methods of altering the biologial activitv 
of the polypeptides of the present invention. Alteration can be. for example, an 

0 increase or decrease in serine/threonine kinase activity, alteration in cellular 
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localization of the protein or alteration in interaction of the polypeptides of the 
present invention with upstream or downstream elements of the signaling pathway. 

The present invention is understood to include the polypeptides having amino 
acid sequences comprising SEQ ID NO. 2 or SEQ ID NO. 4. The present invention 
5 includes polypeptides comprising amino acid sequences analogous to SEQ ID NO. 2 
or SEQ ID NO. 4. Such proteins are defined herein as C12-2bs or Se.new analogs, or 
C12-2bs or 5e.new variants. Analogous amino acid sequences are defined herein to 
mean amino acid sequences with sufficient identity of amino acid sequence with, 
e.g.. C12-2bs or Se.new proteins, to possess the biological activity of Sterile 20 

10 protein kinase. The biological activity of Sterile 20 protein kinase can include, for 
example, the capability to phosphorylate the same target as that phosphoryiated by 
Se.new or C12-2bs such as serine and threonine residues of target proteins in human 
and animal cells in response to extracelluarly derived or intracellular^ derived 
stimuli. For example, an analog polypeptide can be produced with "silent" changes 

IS in the amino acid sequence wherein one, or more amino acid residue differs from the 
amino acid residues of C12-2bs or Se.new, yet still possess the biological activity of 
Se.new or C12-2bs. Examples of such differences include additions, deletions or 
substitutions of residues to e.g., SEQ ID NOs. 2 or 4. Also encompassed by the 
present invention are variant proteins that exhibit lesser or greater biological activity 

20 of the Sterile 20 protein kinases of the present invention. 

Variant proteins of the present invention can be produced using in vitro and 
in vivo techniques well-known to those of skill in the art, for example, site-specific 
mutagenesis and oligonucleotide mutagenesis. Manipulations of the protein 
sequences can be made a the protein level as well. Any numerous chemical 

2:> modifications can be carried out by known techniques including, but not limited to, 
specific chemical cleavage by cyanogen bromide, trypsin and papain. The proteins 
of the present invention can also be structurally modified or denatured, for example, 
by heat. In general, mutations can be conservative or non-conservative amino acid 
substitutions, amino acid insertions or amino acid deletions. 
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For example, DNA encoding a variant protein of the present invention is 
prepared by site-directed mutagenesis of the polynucleotides comprising SEQ ID 
NO. 1 or SEQ ID NO. 3. Site-directed (site-specific) mutagenesis allows the 
production of protein variants through the use of specific oligonucleotide sequences 
5 that encode the DNA sequence of the desired mutation, as well as a sufficient number 
of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence 
complexity to form a stable duplex on both sides of the deletion junction being 
traversed. The term "primer" denotes a specific oligonucleotide sequence 
complementary to a target nucleotide sequence and used to hybridize to the target 

1 0 nucleotide sequence and serve as an initiation point for nucleotide polymerization 
catalyzed by either DNA polymerase, RNA polymerase or reverse transcriptase. 
Typically, a primer of about 20 to 25 nucleotides in length is preferred, with about 5 
to 1 0 residues on both sides of the junction of the sequence being altered. In general, 
the techniques of site-specific mutagenesis are well known in the art. as exemplified 

1 5 by publications such as Edelman et al., DNA 2.T83, 1983. The site-specific 

mutagenesis technique typically employs a phage vector that exists in both a single- 
stranded and double-stranded form. Typical vectors useful in site-directed 
mutagenesis include vectors such as the Ml 3 phage, for example, as disclosed by 
Messing et al.. Third Cleveland Symposium on Macromolecides and Recombinant 

0 DNA. A. Walton, ed., Elsevier, Amsterdam, 1981. This and other phage vectors are 
commercially available and their use is well-known to those skilled in the art. A 
versatile and efficient procedure for the construction of oligonucleotide directed site- 
specific mutations in DNA fragments using M13-derived vectors was published by 
Zoller. M.J. and Smith, M.. Nucleic Acids Res. / 0:6487-6500. 1982. Also, plasmid 

5 vectors that contain a single-stranded phage origin of replication can be employed to 
obtain single-stranded DNA. Veira ei al., Meth Enzymol. 153:3 19S7. Alternatively, 
nucleotide substitutions can be introduced by synthesizing the appropriate DNA 
fragment in vitro, and amplifying it by PCR procedures known in the art. 

In general, site-specific mutagenesis herewith can be performed by first 

0 obtaining a single-stranded vector that includes within its sequence a DNA sequence 
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that encodes the relevant protein. An oligonucleotide primer bearing the desired 
mutated sequence is prepared, generally synthetically, for example, by the method of 
Crea et ai. Proc Nail Acad Sci USA, 75:5765, 1978. This primer can then be 
annealed with the single-stranded protein sequence-containing vector, and subjected 
5 to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, to 
complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is 
formed wherein one strand encodes the original non-mutated sequence and the 
second strand bears the desired mutation. This heteroduplex vector can then be used 
to transform appropriate host cells such as JM 101 cells, and clones can be selected 
10 that include recombinant vectors bearing the mutated sequence arrangement. 
Thereafter, the mutated region can be removed and placed in an appropriate 
expression vector for protein production. 

The PCR technique can also be used in creating amino acid sequence variants 
of the proteins of the present invention. When small amounts of template DNA are 
15 used as starting material in a PCR ? primers that differ slightly in sequence from the 
corresponding region in a template DNA can be used to generate relatively large 
quantities of a specific DNA fragment that differs from the template sequence only at 
the positions where the primers differ from the template. For introduction of a 
mutation into a plasmid DNA, one of the primers can be designed to overlap the 
20 position of the mutation and to contain the mutation; the sequence of the other primer 
is preferably identical to a stretch of sequence of the opposite strand of the plasmid, 
but this sequence can be located anywhere along the plasmid DNA. It is preferred, 
however, that the sequence of the second primer is located within 500 nucleotides 
from that of the first, such that in the end the entire amplified region of DNA 
25 bounded by the primers can be easily sequenced. PCR amplification using a primer 
pair like the one just described results in a population of DNA fragments that differ at 
the end position of the mutation specified by the primer. 

The DNA fragments produced bearing the desired mutation can be used to 
replace the corresponding region in the plasmid that served as PCR template using 
standard DNA technology. Mutations at separate positions can be introduced 
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simultaneously by either using a mutant second primer or performing a second PCR 
with different mutant primers and ligating the two resulting PCR fragments 
simultaneously to the vector fragment in a three (or more) part ligation. 

Another method for preparing variants, cassette mutagenesis, is based on the 
5 technique described by Wells et cil. Gene 34, 3 1 5. 1 985. The starting material can be 
the plasmid (or vector) comprising SEQ ID NO. 1 or SEQ ID NO 3, or a portion 
thereof to be mutated. The codon(s) within the polynucleotide to be mutated are 
identified. There must be unique restriction endonuclease sites on each side of the 
identified mutation site(s). If such restriction sites do not exist, they can be generated 
1 0 using the above-described oligonucleotide-mediated mutagenesis method to 

introduce them at appropriate locations in the polynucleotide. After the restriction 
sites have been introduced into the plasmid, the plasmid is cut at these sites to 
linearize it. A double stranded oligonucleotide encoding the sequence of the DNA 
between the restriction sites but containing the desired mutation(s) is synthesized 
1 5 using standard procedures. The two strands are synthesized separately and then 

hybridized together using standard techniques. This double-stranded oligonucleotide 
is referred to as the cassette. This cassette is designed to have 3' and 5' ends that are 
compatible with the ends of the linearized plasmid. such that it can be directly iigated 
to the plasmid. The plasmid now contains the mutated DNA sequence, that can be 
20 expressed to produce altered proteins with altered kinase, binding or antigenic 
activity. 

The present invention also encompasses biologically active protein, or 
biologically active fragments thereof as described herein. Such fragments can 
include part of the full-length amino acid sequence of a Sterile 20 protein kinase yet ' 
possess biological activity. Such fragments can be produced by amino- and 
carboxyl-terminal deletions, as well as internal deletions. Such peptide fragments can 
be tested for biological activity as described herein. Thus, a functional or biologically 
active protein includes mutants or variants of the endogenous protein wherein one or 
more amino acids have been substituted, deleted or added. The biologically active 
30 fragments can be altered to have enhanced or reduced biological activitv. 



25 
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The biologically active fragments of the protein and polypeptides of the 
present invention comprise at least six continuous amino acids provided in SEQ ID. 
NOs.: 2 and 4. The proteins and polypeptide fragments can be used, for example, 
for structure determination, to assay other molecules' effects on the activity of the 
5 polypeptides of the present invention. These effects can be to inhibit or enhance the 
biological activity or to alter the subcellular location of the protein. The proteins 
and polypeptide fragments of the present invention can further be used to obtain 
antibodies that specifically bind to the polypeptide of the present invention. A useful 
fragment of the polypeptide of interest comprises about 5 to about 35 amino acids of 
1 0 the polypeptides of the present invention. 

The DNA sequences of the present invention can also be used in a 
recombinant construct for the infection, transfection or transformation of a cell in 
vitro or in vivo under control of an appropriate promoter for the expression of 
functional Sterile 20 protein kinases, as defined herein, in an appropriate host cell. 
1 5 Such recombinant constructs are also referred to herein as expression vectors. For 
example, a DNA sequence can be functionally ligated to a suitable promoter (e.g., a 
constitutive or inducible promoter or the endogenous promoter) introduced into a 
suitable expression vector which is then introduced into a suitable host cell. 
Examples of useful promoter sequences include, for example, the early and late 
20 promoters of SV40 or adenovirus, the T3 and T7 promoters, the major operator and 
promoter regions of phage lambda, and the like. Constitutive and inducible promoter 
sequences known to control the expression of genes in prokaryotic or eukaryotic cells 
their viruses or various combinations thereof are useful in the expression of the DNA 
sequences of the present invention.. Suitable host cells for use in expressing the DNA 
25 sequences of the present invention include well known eukaryotic and prokaryotic 
hosts, such as strains of E. coli, Pseudomonas, Bacillus, Streptomyces, insect cells 
such as Spodoptera frugiperda (SF9). animal cells such as CHO and mouse cells, 
African green monkey cells such as COS1 and COS7 and human cells as well as 
plant cells in tissue culture. One of skill in the art may make a selection among the 
vectors expression control sequences and host cells without undue experimentation 
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and without departing from the scope of this invention. The construct can also 
include DNA encoding one or more selectable markers (such as neo, gdhfr and hyg) 
or DNA encoding one or more different antigens or therapeutic proteins. 

The invention also provides vectors containing the serine/threonine kinases of 
5 the present invention and altered forms thereof. Suitable vectors for use in 
eukaryotic and prokaryotic cells are well known in the art and are, generally 
commercially available, or readily prepared by the skilled artisan. For example, 
suitable plasmids for use include pGEX 2T/3X/4T or pET series vectors. Additional 
vectors can also be found in, for example. Ausubel et ai. -'Current Protocols in 
10 Molecular Biology ", John Wiley & Sons, (1998) and Sambrook et ai. 'Molecular 
Cloning: A Laboratoty Manual". 2nd Ed. (1989), the teachings of which are 
incorporated herein by reference. 

The construct can be introduced by any suitable means, as set forth above, 
such as by calcium phosphate precipitation, microinjection, electroporation or 
1 5 infection (such as with an infectious retroviral, herpes vaccinia or adenovirus vector). 
The host cell can be a eucaryotic or procaryotic cell. Suitable cells include bacterial 
( e.g. E. coli) or mammalian cells. Mammalian cells include primary somatic cells, 
such as. epithelial cells, fibroblasts, keratinocytes, macrophages or T cells, or 
immortalized cell lines, such as HeLa or HT1080. The recombinant host cell can 
20 then be cultured and. optionally, selected, m vitro under appropriate conditions 

resulting in the expression of the protein. Alternatively, the cell can be transplanted 
or injected into an animal, such as a human, for in vivo expression. 

The polypeptides and biologically active fragments thereof of the present 
invention may be isolated or purified from recombinant cells or tissue expressing 
25 said polypeptides and purified using any of a variety of conventional methods. These 
methods may include, for example, precipitation, such as ammonium sulfate, ethanol. 
acetone or immuno-precipitation, gel electrophoresis, chromatographic techniques, 
such as normal or reversed phase liquid chromatography. HPLC. FPLC, affinity 
chromatography and size exclusion chromatography or a combination thereof. 
0 -Isolated or purified polypeptide- means a polypeptide of interest or fragment thereof 
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which is essentially free, that is, contains less than about 50%, preferably less than 
about 70%, and more preferably, less than about 90% of cellular components with 
which the polypeptide of interest is naturally associated. Methods for purifying are 
known in the art. One of skill in the art may select the most appropriate isolation and 
5 purification technique without departing from the scope of this invention. 

The present invention further relates to fusion proteins comprising the Sterile 
20 protein kinases described herein (referred to herein as a first moiety) linked to a 
second moiety not occurring in the Sterile 20 protein kinase as found in nature. 
Thus, the second moiety can be a single amino acid, peptide or polypeptide. The first 

1 0 moiety can be in a biologically active fragment of the polypeptide of the present 
invention linked at an N-terminal location, a C-terminal location or to the second 
moiety. The biologically active fragment of the polypeptide of the present invention 
can be fused at both termini to a second moiety. In one embodiment, the fusion 
protein comprises a Sterile 20 protein kinase protein and either a maltose binding 

1 5 protein (MBP) or glutathione-S-transferase (GST). In another embodiment, the 
second moiety is an epitope tag, such as a myc tag or an HA tag. Such fusion 
proteins or epitope-tagged proteins can be isolated using methods well known in the 
art and are useful to produce specific antibodies or can be used in vitro kinase assays. 
Specific antibodies can be used to detect the presence of the polypeptides of 

20 the present invention, fragments thereof or altered forms thereof using standard 

enzyme-linked immunosorbant assay, radioimmunoassay and immunoblot analysis. 
Specific antibodies of the present invention can also be used for immuno- 
cytochemistry on cells or tissues. For example, polyclonal and monoclonal 
antibodies, including non-human and human antibodies, humanized antibodies. 

25 chimeric antibodies and antigen-binding fragments thereof {Current Protocols in 

Immunology: John Wiley & Sons. N.Y. (1994); EP Application 173,494 (Morrison); 
International Patent Application WO86/01533 (Neuberger); and U.S. Patent No. 
5.225.539 (Winters)), which bind to the described polypeptide or altered 
polypeptides or fragments thereof, are within the scope of the invention. 



3NSDOCID: <WO 9929857A 1 



WO 99/29857 



PCT/US98/26116 



■18- 



Antibodies of the present invention can be generated, for example, by 
immunizing a mammal, such as a mouse, rat, hamster or rabbit, with an 
immunogenic form of the polypeptides of the present invention or altered form 
thereof that are capable of eliciting an antibody response). Techniques for conferring 
5 immunogenic^ on a protein or peptide include conjugation to carriers or other 

techniques well known in the art. The protein or polypeptide can be administered in 
the presence of an adjuvant. The progress of immunization can be monitored by 
detection of antibody titers in plasma or serum. Standard ELISA or other 
immunoassays can be used with the immunogen as antigen to assess the levels of 
1 0 antibody. Following immunization, antisera can be obtained, and if desired, 

polyclonal antibodies can be isolated from the serum. Monoclonal antibodies can 
also be produced by standard techniques, which are well known in the art (Kohler 
and Milstein. Nature 256:495-497 (1975); Kozbar et al., Immunology Today 4:12 
(1983); and Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
5 Inc. pp. 77-96 (1985)). The term "antibody" as used herein is intended to include 
fragments thereof, such as Fab and F(ab),. 

•Antibodies described herein can be used to inhibit the activity of the 
phosphatase described herein, particularly /„ vitro and in cell extracts, using methods 
known in the art. Additionally, antibodies can be detectably labeled, such as with a 
0 radioactive label, and used to assay for the presence of the expressed protein in a 
cell, such as yeast or tissue culture or a tissue sample, and can be used in an 
immunoabsorption process, such as an ELISA, to isolate the polypeptides or 
fragments thereof of the present invention or altered forms thereof. Tissue samples 
which can be assayed include mammalian tissues, e.g.. differentiated and non- 
S differentiated cells : Examples include bone marrow, thymus, kidney, liver, brain, 
pancreas, fibroblasts, epithelium, and muscle. 

The novel family members of Sterile 20. Cl2-2bs (SEQ ID NO. 1) and 
5e.new (SEQ ID NO. 3 ) described herein are useful to study upstream and 
downstream elements in the signal transduction cascades involving these molecules, 
homologues of these molecules or family members. Further, SEQ ID NO. 1 and 
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SEQ ID NO. 3 or portions thereof can be used to isolate and clone homologies of 
these genes from other organisms such as humans, or to clone closely related family 
members in mouse or in other organisms such as humans. SEQ ID NO.: 1 and 3, or 
portions thereof can be used as probes or PCR primers. The proteins encoded by 
5 SEQ ID NO. 2 and SEQ ID NO. 4 can be used in standard phosphorylation assays to 
discover and characterize effectors and substrates of these proteins. 

These signaling molecules participate in a diverse set of events in the cell in 
response to many different stimuli. For example, for this family of kinases, oxidative 
stress can be a regulatory element. These kinases are important for ischemic, stroke, 
heart disease, inflammation, and cancer and are targets for drug therapeutics, both for 
blocking agents as well as agents that enhance kinase activity. 

The present invention will now be illustrated by the following example, 
which are not intended to be limiting in any way. 

EXEMPLIFICATION 
5 CLONING PUTATIVE Ste20 SERINE/THREONINE KINASES 

PCR m conjunction with the primers SEQ ID NOs. 5-7 was used to amplify 
sequences from a murine erythroleukemia cDNA library in A.gtl 1 . In one reaction, 
the primer set was SEQ ID NOs. 5 and 6. and in a separate reaction, the primer set 
was SEQ ID NOs. 5 and 6. In all reactions Taq polymerase was used. 

Sequence alignment revealed that C12-2bs and Se.new share homology with 
the Sterile 20 family of protein kinases (Figure 4). C12-2bs and Se.new represent 
new members of the Sterile 20 family of protein kinases. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more 
than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 



WO 99/29857 



PCT/US98/26116 



-20- 



CLA1MS 

What is claimed is: 

I • A purified nucleic acid comprising at least 10 continuous nucleotides of a 
nucleic acid sequence provided in SEQ ED NO.: 1 

5 2. The purified nucleic acid of Claim 1 comprising 27 continuous nucleotides of 
said nucleic acid sequence. 

3. The purified nucleic acid of Claim 1 comprising at least 45 continuous 
nucleotides of said nucleic acid sequence. 



10 



4. The purified nucleic acid of Claim 1 comprising at least 1 00 continuous 
nucleotides of said nucleic acid sequence. 

5- A purified nucleic acid comprising at least 10 continuous nucleotides of; 
nucleic acid sequence provided in SEQ ID NO.: 3. 



6. The purified nucleic acid of Claim 5 comprising at least 27 continuous 
nucleotides of said nucleic acid sequence. 

5 7. The purified nucleic acid of Claim 5 comprising at least 45 continuous 
nucleotides of said nucleic acid sequence. 

S. The purified nucleic acid of Claim 5 comprising at least 100 continuous 
nucleotides of said nucleic acid sequence. 
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9. A purified nucleic acid sequence comprising a nucleic acid sequence 
encoding at least 8 continuous amino acids of a amino acid sequence 
provided in SEQ ID NO.: 2. 

10. The purified nucleic acid of Claim 9 wherein said nucleic acid encodes at 
5 least 1 S continuous amino acids of said amino acid sequence. 

1 1 . The purified nucleic acid of Claim 9 wherein said nucleic acid encodes at 
least 34 continuous amino acids of said amino acid sequence. 

12. The purified nucleic acid of Claim 9 wherein said nucleic acid encodes at 
least 54 continuous amino acids of said amino acid sequence. 

10 13. The purified nucleic acid of Claim 9 wherein said nucleic acid encodes a 
serine/threonine kinase. 

14. The purified nucleic acid of Claim 9 wherein said nucleic acid encodes a 
member of the sterile 20 family of proteins. 

15. A purified nucleic acid sequence comprising a nucleic acid sequence 
1 5 encoding at least 8 continuous amino acids of a amino acid sequence 

provided in SEQ ID NO.: 4. 

16. The purified nucleic acid of Claim 15 wherein said nucleic acid encodes at 
least 1 8 continuous amino acids of said amino acid sequence. 

1 7. The purified nucleic acid of Claim 15 wherein said nucleic acid encodes at 
20 least 34 continuous amino acids of said amino acid sequence. 
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18. The purified nucleic acid of Claim 1 5 wherein said nucleic acid encodes at 
least 54 continuous amino acids of said amino acid sequence. 

19. The purified nucleic acid of Claim 15 wherein said nucleic acid encodes a . 
serine/threonine kinase. 

20. The purified nucleic acid of Claim 15 wherein said nucleic acid encodes a 
member of the sterile 20 family of proteins. 

21. A purified nucleic acid comprising a nucleotide sequence which comprises at 
least 16 nucleotides hybridizing under stringent conditions to at least 20 
continuous nucleotides provided in SEQ ID NO. 1 . 
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A purified nucleic acid comprising a nucleotide sequence which comprises at 
least IS nucleotides hybridizing under stringent conditions to at least 20 
continuous nucleotides provided in SEQ ID NO. 1. 

A purified nucleic acid comprising a nucleotide sequence which comprises at 
least 19 nucleotides hybridizing under stringent conditions to at least 20 
continuous nucleotides provided in SEQ ID NO. 1. 



24. A purified nucleic acid comprising a nucleotide sequence which comprises at 
least 16 nucleotides hybridizing under stringent conditions to at least 20 
continuous nucleotides provided in SEQ ID NO. 3. 

2r>. A purified nucleic acid comprising a nucleotide sequence which comprises at 
20 least 1 8 nucleotides hybridizing under stringent conditions to at least 20 

continuous nucleotides provided in SEQ ID NO. 3. 
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26. A purified nucleic acid comprising a nucleotide sequence which comprises at 
least 19 nucleotides hybridizing under stringent conditions to at least 20 
continuous nucleotides provided in SEQ ID NO. 3. 

27. A purified nucleic acid comprising SEQ ID NO. 1. 
5 28. A vector comprising the nucleic acid of Claim 27. 

29. A host cell comprising the vector of Claim 2S. 

30. A method of making the protein encoded by SEQ ID NO. 1 comprising, 
transfecting a host cell with the vector of Claim 28 and isolating the 
expressed protein. 

10 31. A protein encoded by the nucleic acid of Claim 27. 

32. An antibody that specifically binds to the protein of Claim 31. 

33. A purified nucleic acid comprising SEQ ID NO. 3. 

34. A vector comprising the nucleic acid of Claim 33. 

35. A host cell comprising the vector of Claim 34. 

1^ 36. A method of producing a serine/threonine kinase comprising., transfecting a 
host cell with the vector of Claim 34 and isolating the expressed protein. 

37. A protein encoded by the nucleic acid of Claim 34. 

38. An antibody that specifically binds to the protein of Claim 37. 
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39. A purified nucleic acid comprising a nucleic acid sequence encoding an 
amino acid sequence comprising SEQ ID NO. 2. 

40. A purified nucleic acid comprising a nucleic acid sequence encoding an 
amino acid sequence comprising SEQ ID NO. 4. 

:> 41. A serine/threonine kinase comprising the amino acid sequence of SEQ ID 
NO. 2. 

42. An antibody that specifically binds to the serine/threonine kinase of Claim 41 . 

43 . A serine/threonine kinase comprising the amino acid sequence of SEQ ID 
NO. 4. 

10 44. An antibody that specifically binds to the serine/threonine kinase of Claim 43. 
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C12-2bs. Length: 1504 

1 ctiagtcgggc ctigccgggca tgcagaacct gaaagcagac ccagaagagc 
51 tttttaccaa gctiagagaag attggaaagg gctcttttgg cgaagtgttc 
101 aaaggcattg acaatcggac tcagaaagtg gtggccataa aaatcattga 
151 tctggaagaa gccgaggacg agatagagga catccaacaa gagatcacag 
201 tgctgagcca gtgtgacagt ccctacgcca ccaagtacta tggatcctat 
251 ctcaaggata crtaagttgtg gataatcatg gagtatcttg gtggaggctc 
301 tgccctggat crtgttagagc ctggcccttt agatgaaatt cagattgcaa 
351 ccatattacg agaaattctg aaaggacttg attatctaca ctcggagaag 
401 aaaattcaca gagacattaa agaggccaat gttctgctct ccgaacatgg 
4 51 agaggtgaag cicggcagact, ttggagtggc cggccagctg acggataccc 
501 agatcaaaag gaacaccttc gtgggtaccc ccttctggat: ggcgccggag 
551 gtcatcaagc agtcagccta cgactcaaag gcagacarct ggtcccttgg 
601 catcaccgca atagaactgg ccaaaggaga gc caeca cat tetgagctge 
651 accccatgaa ggtgctattc ctcatcccaa agaacaaccc tcccacactg 
701 gaagggaact; acagcaaacc cctcaaggag ttcgtggagg cct: gectgaa 
751 caaggagccc . agcrttaggc ccactgctaa ggaattattg aagcacaaat 
801 tcataatccg caatgeaaag aaaaegtect acttgaccga gcttatcgac 
851 aggtacaaga ggeggaagge ggagcagagc cacgaggact: ccagctcgga 
901 ggactCtGAC GTGGAGACAG ATGGCCAGGC GTZTGGZ.GGC AGCGACTCTG 
951 GGGACTGGAT CTTCACTATC C GGGAGAAAG AT C C CAAGAA TCTGGAGAAC 
1001 GGAACTCTTC AGCTCTCGGA CTTGGAAAGA AATAAGATGA AAGATATC C C 
105. AAAGAAGC C T TTCTCTCAGT G TTTTATCCC ACAATCATTT CTCCTCCTGT 
1101 TTTGCGGAGC TGAAAAGACA AAGAGC CAAG GCATGCGGAG GGAACTTGGG 
1151 GTCAATA.GAA GAGCTGC GGG GGAGCCATCT ACTTGGCGGA &GZ.GGCCTGC 
1201 CCTGGGA.TCT CAGACACTAT GGTGGCACAG C7TGTGCAGC GGCTGC^GAG 
1251 AIATTCTCTG AGT GGC GGAG GAGCCTCAGC GCACTGAAGG CZCATGGCGC 
1301 CCGGGTTGGT TTTTCCTTTC TTCTTCATCT TCCTTCTTTT TAAAAGTCAA 
1351 CGAGAGCCTT TGCCGACTCT GCGAAGAGGT GTCACGGAGG GGCCCACCCG 
1401 CCCTCCCATA GCGCC&GCAC CTGTCCCTCG TGCCGAATTC CTGCAGCCCG 
1451 GGGGATCCAC TAGTTCTAGA GCGGCCGCC?, CCGCGGTGGA GCTCCAGTTT 

Fig. 1 
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1 GGCACGAGCC CAGGTCCCAG GCACCGCCAC AGGTCAAGCC CTGCATTCAG 
51 GAAAGAGAGC AACACTGCAG TTAGCCAAAA GCCAGGCAGG CGAGCGGCAT 
101 AGAGGCCTCG A.TCGAGAAGC CCGGTAGAGC TGCAGAGATA CCTCCGTAGG 
151 AGGAGCCAGT CTCTGCCGGA GGCGCCACCG CCACCACCGC AGAAGCAGCG 
201 CGAAGTAGCA GTCGCCACCA TGGCCCACTC ACCGGTGGCT GTTCAAGTGC 
251 CTGGGATGCA GAATAAXATA GCAGATCCAG AAGAACTGTT CACAAAATTA 
301 GAGCGCATTG GAAAAGGCTC CTTTGGAGAA GTTTTCAAAG GAATTGATAA 
351 CCGTACTCAG CAAGTGGTTG CAATTAAAAT CAXTGACCTT GAGGAAGCTG 
401 AGGATGAAAX AGAAGACATC CAACAAGAAA TAACTG TTTT gagtcagtgc 
451 GACAGCTCAT ATGTAACAAA ATACTATGGG TCCTATTTAA AGGGTTCAAA 
501 AC TAT GGATA ATAATGGAAT ACCTAGGTGG AGGTTCAGCA TTGGATCTTC 
551 TGCGTGCTGG TCGATTTGAT Gagtrccaga "gccaccat gcticaaqgacr 
601 attttgaaag gtcrggacta tctacattct gaaaagaaaa tccaccgaca 
651 cattaaagct gCcaacgtcr tgctttcaffi. ACAAGGTGAT GTTAAACTGG 
701 CTGACTTTGG AGTTGCTGGC CAGCTGACAG ATACACAAAT CAAAAGAAAC 
751 ACCTTCGTAG GGACTCCGTT Ttggatggct cctgaagrra ttcaacagrc 
801 agcttiatgac tctaaagcrg acatatggtc tttgggaatt actgctattg 
851 aacttgccaa gggagagcct Ccgaattctg acatgcat cc aatgagagtt 
SOI crGTTTCTTA TTC CAAAAAA CAACCCTCCA ACTCTTATTG GAGACTTTAC 
951 TAAGTCTTTC AAGGAGTTTA TTGA7GCTTG CCTGAATAAA GACCCGTCAT 
1001 TTCGTCCTAC AGCTAAAGAA CTTTTGAAGC ATAAGTTCAT CGTAAAAAAT 
1051 TCAAAGAAGA CTTCTTATCT GACTGAA1TG A1CGATCGAT TTAAGAGATG 
1101 GAAGGCAGAA GGCCACAGTG ATGAGGAAXC TGATTCCGAG GGCTCTGACT 
1151 CGGAATCCAG CAGCAGGGAA AGTAACCCTC ACCCTGAATG GAGTTTCAC C 
1201 ACTGTGC GTA AGAAGCCTGA TCCAAAGAAA CTGCAGAATG GGGAAGAGCA 
1251 AGATCTTGTG CAAAC CTTGA GCTGTTTGTC TATGATAAXC ACACCTGCAT 
1301 TTGCCGAACT TAAACAGCAG GAC GAGAAXA ATGCGAGTCG AAACCAGGCA 
1351 ATTGAAGAAC TTGAGAAAAG TATTGCTGTG GCTGAAACCG CCTGTCCTGG 
1401 CATCACAGAT AAGATGGTGA AGAAACTAAT CGAAAAATTT CAAAAGTGTT 
1451 CTGCGGATGA ATCCCCTTAA GAAArCTGTT GTCATTACTT TTGGCTTCTG 
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1501 TTTCATGTGG ACCAGGAGAA ACCCACCAAA GCTATGTCAA CCTTATAAAT 

1551 GCTTAACTCA TGAGCTCCAT GTgccttttg gatctttgcc acattgaaga 

1601 tttagaggaa gctattaaac tattttgtga tggtgattat cattttgtat 

1651 tttaaagaga ttattttgta aggaataatt ttaatactat agttttgccg 

1701 gtattgtagt aaatgctgag atacaggttt tttgtttttt gttttttaat 

1751 tttaggtacC AtTAtlTCtT ATGtTCATgG aATGaATACT G tTTgGt TTg 

1801 GaA.TCt.TTAG TTAACTGTAT ACTCATaAAC ATACAGGTCt TTCAAAGTCA 

1851 TCCTAACTAT TAAATGTtTG TAAATCATCA AGCTtCAAAA agCAtTCttr 

1901 ttCCCCcaca caagtatatt ctaaaaagac tatttgtaat gaggtggaag 

1951 taagtaatac cttcttaaaa cctcgtgcc 



Fig. 2B 
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SEQ ±D No, 



ATA GGA TCC CA(CT) (AC) G (AGCT) GA(AT) a?{acT . 
AA(AG) GG (AGCT) GC (AGCT) AA(CT) JiT (aC T) 
(CT)T 3' 



Q Z? ^ ~ t-\ 



TCG GAA xTC (CT) TC (AGCT) GG (AGCT) GC CAT 
(AC-)TA t " 



SEQ IB No. 



TCG GAA TTC (CT) TC (AGCT) GG (AGCT) GC C*t CC^ 
(AG) AA 3' 



Fia . 3 
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5v 6 

1 50 

12-2 MQ NLKADPEZLF TXLZXIGKG3 FGZVFXGIDN 

5e MAI-IZ PVAVQVFGMQ NNIADPEELF TXLZRIGXGS FGEVFXGIDN 

S201 METVQLR>IF? RRQLKKLDED 3LTXQPEEVF D VL Z X LG Z G 3 YG S v~Y XA I KX 

51 100 
12-2 RTQXWAIXI IDLZZAEDEI EDIQQEITYL SQCDSFYVTX YYG3YLKDTX 
5e RTQQYVAIXI IDLZZAEDEI ZDIQQEITVL SQCDSSYVTX YYG5YLKGSX 
S201 ETGQIVAIXQ V?V . . . ESDL QEIIXZI3IM QQCDSl?r~r,~: YYG3YFKNTD 

101 150 

12-* LWIIMZYLGG GSALDL . . LE FGFLDSigiA TILRZILXGL DYLnSZXXIK 

'5e LWIIMZYLGG GSALDL. . LR AGFFDEFQIA TMLXZILKGL D YL-HS EKXZK 

S201 LWIVM5YCGA GSVSDIIRLR NXTLTEDEIA TILQSTLXGL EYLKTMr.QIX 

i-6-2 FJDIX3ANV-L SZHGZ YXLAD FGVAGQLTDT QIXRNTFVGT ?FWMA?ZVIX 
5e FZ'IX.^r/L-L SEQGDVXLAD FG VAGQ LTIT QIKR2TTFVGT PFWMAFEYIC; 
£2 01 RDIXAGNILL NT ZG KAXLAD FGVAGQLTDT MAXF^TT VI G T FFvvKAPEVIQ 

201 250 
12-2 Q S A. YD S KAD I WSLGITAIZL AXGZPPK3ZL KPMXVLFLI? KNNPFTL-..Z 
5e QSAYDSXAD- W3 LG I TA-Z Z-L AXGZFPMSDH KPMRVLFLI? K2tt??FTL . . I 

S201 eigykcvajjI wslgitaizm aegxppyazi hpmpazfmi? tnfpptfrx? 

251 300 

12-2 GNY3XFLXZF YZAGLNXZF3 FRFTAXZLLX HXFIIRNAXX tsyltelid . 

5e GDFTXZFXZF IDACLNXDFS FRPTAXZZLK HXFIVXMSXX TSYLTELID . 

S201 ZLWSDNFHDF VXQCLVXSPE QRATATQLLQ HPF.VX3AXG Y3ILRDLINZ 

12-2 . ...RYXFL WKAZ.QS F.ZDSSSZZ'Su VZTDGQA3GG 3DSGDWIFTI 

5e . ...F.FXR WXASGKS DZZSDSZG3D SZSSSrJElSX? KPEWSFTTVR 

£2 01 A2^V-XL-XP.QZ AQQRZVDQDD ZZNSZEDZ^fD SGVZ^fr.TKGD ZMGTVRVAST 

12-2 RZXDFX;^ ZZGt LQL3DLZR2TX MXDIFXXFFS 

^5e XXFDFXXL QllGZ EQ DLV 

£201 MSGGAMTMIZ HGDTLPSQLG ^lYINTEDZZ ZiGTMKRRDZ TMQ ? A-XF S F L 

4C 2 _ 450 

12-2 QGr^PQSrLL LFCGAZXTKS QGMFRZLGVN RFJLAGZPSTV7 RXRFALG3QT 

5e QTLSGL3MII TFAFA3 LZQQDZinu ASF2IQAIEEL ZX3IAVA. . 

S201 EYFEQXEXZ^ QZNSFGXNVS G 3 LXN33DWX IFQDGDYZFL X3WTVZDLQX 

451 4 cr . 

12-2 LWWKSLCSGC RDIL. . . . 

5e , . . ETAGPGI TDXMVXXLIZ XFQXCSADZ3 P 

S201 F_LL AL DPMMZQEKZZ I RQ XYF.S XRQ PILDAIZAXX RRQQNF 

Fig . 4 
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C12.2ds 1 &L^S?VXVQV?GMQKNIAD?ZELFT:^ 50 

III ! ! 1 1 1 M I! ! I - M ! 1 1 1 i M I ! I ! M I ! i * i' i 

De.nev 1 MQNLKAD ? EZDFTKL EX ZG KG S ?G EVF KG XDN?.TQ2CW 3 3 

C12.2DS 51 A X K X X DL E EAE DE Z ED Z QQ EI TVL S CCD S S YVTXYY G S YL KG S KLV7I T ME i GO 

. MillMIIMIII!MliijilM!Mi.Mli!lil(|]:.![|M|i 

^e.new 29 AIXj.1DLZ Z AZD E Z ZD I Q Q Z I TVL 3 QC D S ? YVT KYY G S YL X DT KLWZ ZME 8 3 

C12. 2bsl01 Y LG GGS ALDL LEAG ? F D E F Q I AT MLXZ ZLKGL D YLHSEKX XKHD Z KAANV ^ 50 
I Nil I INI I =1 |: II = lllh I = 11 I Ml Mill IN | Mill. || | * 

5e.new ss ylc<;gsaldlle?g?ldezqiatil?^ilkgldylksef^ 13 3 

CI2.2bs 15 i LL S Z QGD VK LADFGVAG Q LTDTQ IXF.NT F VGT F FV/MA? Z VZ Q ^ ^ \ 2 00 

I N hhili Ml IMMIMMMIiMIIMiMMIM.IMMM! 

5 £ . new 1 2 3 LLS E KG ZVX1 ADF GVAGQLTDTQ IFP_^T~GT?FW^\?^7IKQSA;T)SXA 138 
C12-2bs 201 DZWSLGZTAZZIJC<GEF?NSDMK^^^ 25 0 

IIIUMIM-I ifllMIM-llhMMIMIIMM I - =" - 1 - = I M 

5 e . nev 13 3 D^WSLGITAZZI^AKGZPPHSZLH?^ 23 3 

ci2.2bs 251 xdaclnk^psff.ptaxzll^^ i 0Q 

: N!MIMM!lliMilllM|:MM!M!IMI!l(:|!ill! M 

oe.nev 239 V^.CL-£*Z<ZPSFF.PTA^^ 237 
C12.2bs 301 DEESD£EGZDSES3S?^S^7F:-1?EWSFTTv^<K?D??Z:I;QNGEEQ 344 
D e , nev 2 3 3 KEDSSSEDSDVZTSGQASGGSOSGDWIFTZ?^ 337 

C12.2bs 245 DZVQTLSCZS^ZZZPAFAE L XQQDENNAS KNQ AZ E 37=! 

\''- * I-: -II 

Se.nev 35Q ^TF^ZDZFXXPFSQCFZFQSFLL^FC^ 337 

CI2.2DS 330 ELZXEZAVA ETA.CFGZTDfZ^/KZZIEF^QKZS^J3ES? 415 

• : I- |:: • • ! • !"• ! * 
De.nev 333 TvTF^FAI-GSQZL/rWKZLCSGCPZ;-! 413 
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